Optimizing regression for in-car speech recognition using multiple distributed microphones

نویسندگان

  • Weifeng Li
  • Fumitada Itakura
  • Kazuya Takeda
چکیده

In this paper, we address issues in improving handsfree speech recognition performance in different car environments using multiple spatially distributed microphones. In previous work, we proposed multiple regression of the log-spectra (MRLS) for estimating the logspectra of speech at a close-talking microphone. In this paper, the idea is extended to nonlinear regressions. Isolated word recognition experiments under real car environments show that, compared to the nearest distant microphone, recognition accuracies could be improved by about 40% for very noisy driving conditions by using the optimizing regression method, The proposed approach outperforms linear regression methods and also outperforms adaptive beamformer by 8% and 3% respectively in terms of averaged recognition accuracies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

In-car speech recognition using distributed microphones-adapting to automatically detected driving conditions

In this paper, we describe a multichannel method of noisy speech recognition that can adapt to various in-car noise situations during driving. The method allows us to estimate the log spectrum of speech at a close-talking microphone based on the multiple regression of the log spectra (MRLS) of noisy signals captured by multiple distributed microphones. Through clustering of the spatial noise di...

متن کامل

Multiple Regression of Log-spectra Fo

This paper describes a new multichannel method of noisy speech recognition, which estimates the log spectrum of speech at a close-talking microphone based on the multiple regression of the log spectra (MRLS) of noisy signals captured by the distributed microphones. Since the method does not assume the arrangement of sound sources and microphones, it can be applied to in-car speech recognition d...

متن کامل

Multiple regression of log-spectra for in-car speech recognition

This paper describes a new multi-channel method of noisy speech recognition, which estimates the log spectrum of speech at a closetalking microphone based on the multiple regression of the log spectra (MRLS) of noisy signals captured by distributed microphones. The advantages of the proposed method are as follows: 1) The method does not require a sensitive geometric layout, calibration of the s...

متن کامل

Detection of Local Disturbances and Simultaneously Active Speakers for Distributed Speaker-Dedicated Microphones in Cars

For automotive hands-free and speech recognition applications, distributed microphones are often mounted in the car where each of the speakers has a dedicated microphone close to his position. To provide additional control information for further speech enhancement, it is often advantageous to distinguish between the activity of the different passengers. In this contribution speaker activity is...

متن کامل

CENSREC2: corpus and evaluation environments for in car continuous digit speech recognition

This paper introduces a common database and an evaluation framework for connected digit speech recognition in real driving car environments, CENSREC-2, as an outcome of IPSJ-SIG SLP Noisy Speech Recognition Evaluation Working Group. Speech data of CENSREC-2 was collected using two microphones, a close-talking microphone and a hands-free microphone, under three car speeds and four car conditions...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004